-
Notifications
You must be signed in to change notification settings - Fork 188
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
explicitly adding docs for destination item size control #2118
Conversation
✅ Deploy Preview for dlt-hub-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
the whole section looks confusing for me, I would make it as a note or info admonition in the Load section.
I would rephrase it as follows:
By enabling file rotation and adjusting the
file_max_items
andfile_max_bytes
settings, you can control the size and number of data chunks sent to the destination database. This can be particularly useful when dealing with large tables with millions of records, as it allows you to manage memory constraints on the database server and enhance performance.
@@ -50,6 +50,8 @@ Below, we set files to rotate after 100,000 items written or when the filesize e | |||
|
|||
<!--@@@DLT_SNIPPET ./performance_snippets/toml-snippets.toml::file_size_toml--> | |||
|
|||
### Controlling destination items sizes | |||
As mentioned above, `dlt` uses the the same files created between **normalize** and **load** for loading to the destination. You can therefore control the destination items' sizes by controlling the maximum single file size or enabling rotation at the **normalize** stage as demonstrated in the snippet above. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As mentioned above, `dlt` uses the the same files created between **normalize** and **load** for loading to the destination. You can therefore control the destination items' sizes by controlling the maximum single file size or enabling rotation at the **normalize** stage as demonstrated in the snippet above. | |
As mentioned above, `dlt` uses the same files created between **normalize** and **load** steps for loading data to the destination. You can therefore control the destination items' sizes by controlling the maximum single file size or enabling rotation at the **normalize** stage as demonstrated in the snippet above. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Good! Thank you Hulma <3
Description
Adding explicit documentation on how to control the destination item sizes. The files created in normalize stage are used for loading as well so controlling normalize file sizes (by setting max file size or enabling rotation) should control destination items sizes too. Currently, the bot does not pick it up as it's not explicit.
Related Issues
Additional Context